# LibriSpeech optimized

Assignment1 Jack
MIT
A speech-to-text (S2T) model for automatic speech recognition (ASR), based on a sequence-to-sequence transformer architecture
Speech Recognition Transformers English
A
Classroom-workshop
24
0
Assignment1 Jane
MIT
s2t-small-librispeech-asr is a speech-to-text (S2T) model for automatic speech recognition (ASR), based on a sequence-to-sequence transformer architecture.
Speech Recognition Transformers English
A
Classroom-workshop
29
0
Wav2vec2 Large 960h Lv60 Self 4 Gram
Apache-2.0
Based on Facebook's Wav2Vec2-Large-960h-lv60-self model, enhanced with an English 4-gram language model to improve speech recognition accuracy
Speech Recognition English
W
patrickvonplaten
22
4
Wav2vec2 Base 960h 4 Gram
Apache-2.0
Based on Facebook's Wav2Vec2-Base-960h model, with an added English 4-gram language model to improve automatic speech recognition (ASR) accuracy.
Speech Recognition Transformers English
W
patrickvonplaten
19
0
Wav2vec2 2 Bart Large No Adapter
This model is an automatic speech recognition (ASR) model trained on the LibriSpeech ASR dataset, capable of converting English speech into text.
Speech Recognition Transformers
W
sanchit-gandhi
22
0
S2t Medium Librispeech Asr
MIT
A speech-to-text (S2T) model for automatic speech recognition (ASR), based on a sequence-to-sequence transformer architecture
Speech Recognition Transformers English
S
facebook
1,086
9
Wavlm Libri Clean 100h Base Plus
An automatic speech recognition model fine-tuned on the LIBRISPEECH_ASR - CLEAN dataset based on microsoft/wavlm-base-plus
Speech Recognition Transformers
W
patrickvonplaten
126.17k
3
Wav2vec2 Base 960h
Apache-2.0
Wav2Vec2 is a self-supervised learning-based speech recognition model developed by Facebook, trained on the LibriSpeech dataset, supporting English speech-to-text tasks.
Speech Recognition Transformers English
W
tommy19970714
19
0
Wav2vec2 2 Bert Large No Adapter
An automatic speech recognition (ASR) model trained on the LibriSpeech dataset for converting English speech to text
Speech Recognition Transformers
W
speech-seq2seq
15
1
S2t Large Librispeech Asr
MIT
An end-to-end sequence-to-sequence transformer model for automatic speech recognition (ASR), trained on the LibriSpeech dataset
Speech Recognition Transformers English
S
facebook
422
10
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase